Method and system for cache data analysis for enterprise content management systems

ABSTRACT

Content categories are identified based on content provided by a server associated with an enterprise content management system. Using the relationship database, a content category is identified based on content associated with a client associated with a user, the content associated with the client associated with the user including content accessed by the user. A document associated with the content category is identified. A message is sent from the server to the client associated with the user to suggest to the user the document associated with the content category.

CROSS-REFERENCE TO RELATED APPLICATION(S)

This application is a continuation of, and claims a benefit of priorityunder 35 U.S.C. 120 of the filing date of U.S. patent application Ser.No. 16/694,629, filed Nov. 25, 2019, entitled “METHOD AND SYSTEM FORCACHE DATA ANALYSIS FOR ENTERPRISE CONTENT MANAGEMENT SYSTEMS,” issuedas U.S. Pat. No. 10,965,754, which is a continuation of, and claims abenefit of priority under 35 U.S.C. 120 of the filing date of U.S.patent application Ser. No. 16/113,537, filed Aug. 27, 2018, entitled“METHOD AND SYSTEM FOR CACHE DATA ANALYSIS FOR ENTERPRISE CONTENTMANAGEMENT SYSTEMS,” issued as U.S. Pat. No. 10,498,826, which is acontinuation of, and claims a benefit of priority under 35 U.S.C. 120 ofthe filing date of U.S. patent application Ser. No. 14/036,174, filedSep. 25, 2013, issued as U.S. Pat. No. 10,091,292, entitled “METHOD ANDSYSTEM FOR CACHE DATA ANALYSIS FOR ENTERPRISE CONTENT MANAGEMENTSYSTEMS,” the entire contents of which are hereby expressly incorporatedby reference for all purposes.

BACKGROUND

An enterprise content management system provides online access tocontent stored using digital technology, information formerly availableonly on paper, microfilm, or microfiche. An enterprise contentmanagement system generates new metadata about content as the content ischecked in and out. Information about how and when content is used canenable an enterprise content management system to acquire new filtering,routing and search pathways, and retention-rule decisions. An enterprisecontent management system provides access to data about email andinstant messaging, which may be used in business decisions. The audittrail generated by an enterprise content management system enhancesdocument security, and provides metrics to help measure productivity andidentify efficiency. An enterprise content management system providesintegrated solutions for multiple departments and systems, as manydocuments may cross multiple departments and affect multiple processes.For example, imaging, document management, and workflow can be combinedin a customer service department to enable customer service agents tobetter resolve customer inquiries. Likewise, an accounting departmentmay access supplier invoices from an electronic resource managementsystem, access purchase orders from an imaging system, and accesscontracts from a document management system as part of an approvalworkflow. Similarly, an organization may present information via theWorld Wide Web, which requires managing web content.

BRIEF DESCRIPTION OF THE DRAWINGS

Advantages of the subject matter claimed will become apparent to thoseskilled in the art upon reading this description in conjunction with theaccompanying drawings, in which like reference numerals have been usedto designate like elements, and in which:

FIG. 1 is a block diagram illustrating an example hardware device inwhich the subject matter may be implemented;

FIG. 2 illustrates a block diagram of an example system for cache dataanalysis for enterprise content management systems, under an embodiment;and

FIG. 3 is a flowchart that illustrates a method for cache data analysisfor enterprise content management systems, under an embodiment.

DETAILED DESCRIPTION

A typical deployment of an enterprise content management system includesa centralized content server that manages all content, an applicationthat provides a user interface to the content, and several cache serverswhich serve as caching repositories for serving and ingesting contentcloser to users based on their geographies, with some enterprises usingas many as 100 cache servers located all over the world. A user whorequests a document via their geographically-based cache server mayexperience lag time in receiving one document of millions of documentsmanaged by a centralized content server that is located on the otherside of the planet because their geographically-based cache server hasnot already cached that document.

Embodiments herein provide cache data analysis for enterprise contentmanagement systems. A content category is identified based on contentprovided by a cache server associated with an enterprise contentmanagement system. For example, a cache server analyzer analyzes thecontent that a cache server located in Bangalore, India, provides inresponding to user requests, which results in caching documents that arerelated to accounting more often than a provision threshold, such as 25%of the time. A determination is made whether content associated with thecontent category is cached by the cache server. For example, the cacheserver analyzer determines whether all of the accounting documentsmanaged by a content server located in Boston, Mass., are cached by thecache server in Bangalore, India. The content associated with thecontent category is requested by the cache server if the contentassociated with the content category is not cached by the cache server.For example, the cache server analyzer requests any of the accountingdocuments managed by the content server in Boston, Mass., which are notcached by the cache server located in Bangalore, India. The contentassociated with the content category is cached by the cache server. Forexample, the cache server in Bangalore, India caches any of theaccounting documents managed by the content server in Boston, Mass.,which were not already cached by the cache server in Bangalore, India.The cache server pre-caches content likely to be requested by itsgeographically based users before the content is actually requested byits geographically based users, thereby significantly reducing the lagtime experienced by these users when they request the content that theyare likely to request.

Prior to describing the subject matter in detail, an exemplary hardwaredevice in which the subject matter may be implemented shall first bedescribed. Those of ordinary skill in the art will appreciate that theelements illustrated in FIG. 1 may vary depending on the systemimplementation. With reference to FIG. 1, an exemplary system forimplementing the subject matter disclosed herein includes a hardwaredevice 100, including a processing unit 102, memory 104, storage 106,data entry module 108, display adapter 110, communication interface 112,and a bus 114 that couples elements 104-112 to the processing unit 102.

The bus 114 may comprise any type of bus architecture. Examples includea memory bus, a peripheral bus, a local bus, etc. The processing unit102 is an instruction execution machine, apparatus, or device and maycomprise a microprocessor, a digital signal processor, a graphicsprocessing unit, an application specific integrated circuit (ASIC), afield programmable gate array (FPGA), etc. The processing unit 102 maybe configured to execute program instructions stored in memory 104and/or storage 106 and/or received via data entry module 108.

The memory 104 may include read only memory (ROM) 116 and random accessmemory (RAM) 118. Memory 104 may be configured to store programinstructions and data during operation of device 100. In variousembodiments, memory 104 may include any of a variety of memorytechnologies such as static RAM (SRAM) or dynamic RAM (DRAM), includingvariants such as dual data rate synchronous DRAM (DDR SDRAM), errorcorrecting code synchronous DRAM (ECC SDRAM), or RAMBUS DRAM (RDRAM),for example. Memory 104 may also include nonvolatile memory technologiessuch as nonvolatile flash RAM (NVRAM) or ROM. In some embodiments, it iscontemplated that memory 104 may include a combination of technologiessuch as the foregoing, as well as other technologies not specificallymentioned. When the subject matter is implemented in a computer system,a basic input/output system (BIOS) 120, containing the basic routinesthat help to transfer information between elements within the computersystem, such as during start-up, is stored in ROM 116.

The storage 106 may include a flash memory data storage device forreading from and writing to flash memory, a hard disk drive for readingfrom and writing to a hard disk, a magnetic disk drive for reading fromor writing to a removable magnetic disk, and/or an optical disk drivefor reading from or writing to a removable optical disk such as acompact disc (CD) ROM, digital versatile disc (DVD) or other opticalmedia. The drives and their associated computer-readable media providenonvolatile storage of computer readable instructions, data structures,program modules and other data for the hardware device 100.

It is noted that the methods described herein can be embodied inexecutable instructions stored in a computer readable medium for use byor in connection with an instruction execution machine, apparatus, ordevice, such as a computer-based or processor containing machine,apparatus, or device. It will be appreciated by those skilled in the artthat for some embodiments, other types of computer readable media may beused which can store data that is accessible by a computer, such asmagnetic cassettes, flash memory cards, digital video disks, Bernoullicartridges, RAM, ROM, and the like may also be used in the exemplaryoperating environment. As used here, a “computer-readable medium” caninclude one or more of any suitable media for storing the executableinstructions of a computer program in one or more of an electronic,magnetic, optical, and electromagnetic format, such that the instructionexecution machine, system, apparatus, or device can read (or fetch) theinstructions from the computer readable medium and execute theinstructions for carrying out the described methods. A non-exhaustivelist of conventional exemplary computer readable medium includes: aportable computer diskette; a RAM; a ROM; an erasable programmable readonly memory (EPROM or flash memory); optical storage devices, includinga portable CD, a portable DVD, a high-definition DVD (HD-DVD™), aBLU-RAY disc; and the like.

A number of program modules may be stored on the storage 106, ROM 116 orRAM 118, including an operating system 122, one or more applicationsprograms 124, program data 126, and other program modules 128. A usermay enter commands and information into the hardware device 100 throughdata entry module 108. Data entry module 108 may include mechanisms suchas a keyboard, a touch screen, a pointing device, etc. Other externalinput devices (not shown) are connected to the hardware device 100 viaexternal data entry interface 130. By way of example and not limitation,external input devices may include a microphone, joystick, game pad,satellite dish, scanner, or the like. In some embodiments, externalinput devices may include video or audio input devices such as a videocamera, a still camera, etc. Data entry module 108 may be configured toreceive input from one or more users of device 100 and to deliver suchinput to processing unit 102 and/or memory 104 via bus 114.

A display 132 is also connected to the bus 114 via display adapter 110.Display 132 may be configured to display output of device 100 to one ormore users. In some embodiments, a given device such as a touch screen,for example, may function as both data entry module 108 and display 132.External display devices may also be connected to the bus 114 viaexternal display interface 134. Other peripheral output devices, notshown, such as speakers and printers, may be connected to the hardwaredevice 100.

The hardware device 100 may operate in a networked environment usinglogical connections to one or more remote nodes (not shown) viacommunication interface 112. The remote node may be another computer, aserver, a router, a peer device or other common network node, andtypically includes many or all of the elements described above relativeto the hardware device 100. The communication interface 112 mayinterface with a wireless network and/or a wired network. Examples ofwireless networks include, for example, a BLUETOOTH network, a wirelesspersonal area network, a wireless 802.11 local area network (LAN),and/or wireless telephony network (e.g., a cellular, PCS, or GSMnetwork). Examples of wired networks include, for example, a LAN, afiber optic network, a wired personal area network, a telephony network,and/or a wide area network (WAN). Such networking environments arecommonplace in intranets, the Internet, offices, enterprise-widecomputer networks and the like. In some embodiments, communicationinterface 112 may include logic configured to support direct memoryaccess (DMA) transfers between memory 104 and other devices.

In a networked environment, program modules depicted relative to thehardware device 100, or portions thereof, may be stored in a remotestorage device, such as, for example, on a server. It will beappreciated that other hardware and/or software to establish acommunications link between the hardware device 100 and other devicesmay be used.

It should be understood that the arrangement of hardware device 100illustrated in FIG. 1 is but one possible implementation and that otherarrangements are possible. It should also be understood that the varioussystem components (and means) defined by the claims, described below,and illustrated in the various block diagrams represent logicalcomponents that are configured to perform the functionality describedherein. For example, one or more of these system components (and means)can be realized, in whole or in part, by at least some of the componentsillustrated in the arrangement of hardware device 100. In addition,while at least one of these components are implemented at leastpartially as an electronic hardware component, and therefore constitutesa machine, the other components may be implemented in software,hardware, or a combination of software and hardware. More particularly,at least one component defined by the claims is implemented at leastpartially as an electronic hardware component, such as an instructionexecution machine (e.g., a processor-based or processor-containingmachine) and/or as specialized circuits or circuitry (e.g., discretelogic gates interconnected to perform a specialized function), such asthose illustrated in FIG. 1. Other components may be implemented insoftware, hardware, or a combination of software and hardware. Moreover,some or all of these other components may be combined, some may beomitted altogether, and additional components can be added while stillachieving the functionality described herein. Thus, the subject matterdescribed herein can be embodied in many different variations, and allsuch variations are contemplated to be within the scope of what isclaimed.

In the description that follows, the subject matter will be describedwith reference to acts and symbolic representations of operations thatare performed by one or more devices, unless indicated otherwise. Assuch, it will be understood that such acts and operations, which are attimes referred to as being computer-executed, include the manipulationby the processing unit of data in a structured form. This manipulationtransforms the data or maintains it at locations in the memory system ofthe computer, which reconfigures or otherwise alters the operation ofthe device in a manner well understood by those skilled in the art. Thedata structures where data is maintained are physical locations of thememory that have particular properties defined by the format of thedata. However, while the subject matter is being described in theforegoing context, it is not meant to be limiting as those of skill inthe art will appreciate that various of the acts and operationsdescribed hereinafter may also be implemented in hardware.

To facilitate an understanding of the subject matter described below,many aspects are described in terms of sequences of actions. At leastone of these aspects defined by the claims is performed by an electronichardware component. For example, it will be recognized that the variousactions can be performed by specialized circuits or circuitry, byprogram instructions being executed by one or more processors, or by acombination of both. The description herein of any sequence of actionsis not intended to imply that the specific order described forperforming that sequence must be followed. All methods described hereincan be performed in any suitable order unless otherwise indicated hereinor otherwise clearly contradicted by context. In an embodiment, thecomputer system 100 includes one or more methods for cloning enterprisemanagement systems.

In the prior art, a user who requests a document via theirgeographically-based cache server may experience lag time in receivingone document of millions of documents managed by a centralized contentserver that is located on the other side of the planet because theirgeographically-based cache server has not already cached that document.Embodiments herein provide cache data analysis for enterprise contentmanagement systems. A cache server pre-caches content likely to berequested by its geographically based users before the content isactually requested by its geographically based users, therebysignificantly reducing the lag time experienced by these users when theyrequest the content that they are likely to request.

FIG. 2 illustrates a block diagram of a system that implements cachedata analysis for enterprise management systems, under an embodiment. Asshown in FIG. 2, system 200 may illustrate a cloud computing environmentin which data, applications, services, and other resources are storedand delivered through shared data centers and appear as a single pointof access for the users. The system 200 may also represent any othertype of distributed computer network environment in which serverscontrol the storage and distribution of resources and services fordifferent client users.

In an embodiment, the system 200 represents a cloud computing systemthat includes a first client 202, and a second client 204; and a cacheserver 206 and a content server 208 that are provided by a hostingcompany. The clients 202-204 and the servers 206-208 communicate via anetwork 210. The cache server 206 includes a cache server analyzer 212and cached content 214, and the content server 208 accesses systemcontent 216. The cache server analyzer 212 may include a tags extractor218, an analytics engine 220, a relationship mapper 222, and arelationship database 224. The function of the cache server analyzer 212may be divided into fewer elements than the elements 218-224 or moreelements than the elements 218-224. Although the cache server analyzer212 and its elements 218-224 are depicted as residing completely on thecache server 206, the cache server analyzer 212 and its elements 218-224may reside in any combination of completely or partially on the cacheserver 206, the content server 208, and/or other servers that are notdepicted in FIG. 2. Although FIG. 2 depicts only two clients 202-204,two servers 206-208, one network 210, one cache server analyzer 212, andone each of the elements 218-224 for the purposes of presenting anextremely simplified example, the system 200 may include any number ofclients 202-204, servers 206-208, networks 210, cache server analyzers212, and elements 218-224. The clients 202-204 and the servers 206-208may each be substantially similar to the system 100 depicted in FIG. 1.

A cache server analyzer identifies a content category based on contentprovided by a cache server associated with an enterprise contentmanagement system. For example, after the cache server 206 storescontent as the cached content 214, the tag extractor 218 processes thecontent (synchronously or asynchronously) and parses the content intotokens. For example, the tag extractor 218 parses content in a documentinto tokens such as “invoice,” “payment,” “is,” “due,” “upon,” and“receipt,” with the token “invoice” tagged as a keyword. The analyticsengine 220 uploads the tags and tokens, along with some informationabout the cache server 206, the user associated with the first client202 that requested the content, and some metadata to identify thecontent itself. The analytics engine 220 analyzes the tags, tokens,information, and metadata, and feeds this data to the relationshipmapper 222. The relationship mapper 222 creates relationships based onthe tags, tokens, information, and metadata, and stores therelationships, tags, tokens, information, and metadata in therelationship database 224. The relationship mapper 222 also updates thecache server 206 with information about the cached content 214, such asmore than 25% of the cached content 214 includes accounting documents.The relationship database 224 is also accessible to the content server208 located in Boston, Mass. In this example, the cache server 206located in Bangalore, India, responds to user requests by cachingdocuments that are related to accounting more often than a provisionthreshold, such as 25% of the time.

The cache server analyzer determines whether content associated with thecontent category is cached by the cache server. For example, the contentserver 206 in Boston, Mass., has analyzed the content in the systemcontent 216, and thereby identified content categories for documents,such as accounting, customer service, human resources, web design,engineering, and legal. The cache server analyzer 212 determines whetherall of the accounting documents managed by the content server 208 inBoston, Mass., are cached by the cache server 206 in Bangalore, India.

The cache server requests content associated with the content categoryif the content associated with the content category is not cached by thecache server. For example, the cache server analyzer 212 requests any ofthe accounting documents managed by the content server 208 in Boston,Mass. that are not already cached by the cache server 206 in Bangalore,India. The cache server 206 may also communicate with other cacheservers to receive content associated with the content category, whichis parked on the other cache servers but not yet uploaded to the contentserver 208. The cache server 206 may request the likely contentperiodically to ensure that the cached content 214 is the most recentversion of the corresponding documents. Alternatively, or additionally,the cache server 206 may use the cached content 214 to provide arequested document to a requesting user, and then confirm with thecontent server 208 that the document provided to the user is the mostrecent version of the document while the user accesses the document.

The cache server caches the content associated with the contentcategory. For example, the cache server 206 in Bangalore, India cachesany of the accounting documents managed by the content server 208 inBoston, Mass., which were not already cached by the cache server 206 inBangalore, India. The cache server 206 pre-caches content likely to berequested by its geographically based users before the content isactually requested by its geographically based users, therebysignificantly reducing the lag time experienced by these users when theyrequest the content that they are likely to request.

The cache server analyzer may identify a content category based oncontent associated with a client associated with a user. For example,the cache server analyzer 212 analyzes that a user located in Delhi,India, requests documents that are related to accounting more often thanan association threshold, such as 50% of the time. The cache serveranalyzer 212 may create a user profile based on the content categoriesaccessed by a user. For example, the cache server analyzer 212 creates agraph database that maintains the relationship between a user, contentaccessed by the user, other users who access the same content, and othercontent accessed by the other users. The nodes of the graph database canbe the tags created by the tag extractor 218 and the users identified bythe analytics engine 220. The edges in the graph database are therelations between various nodes, such as tags and users. The cacheserver analyzer 212 can also use the tags to determine relationshipsbetween nodes in the graph database, users and other tags.

The cache server analyzer may send a message to a client to suggestcontent associated with the content category. For example, the cacheserver analyzer 212 sends a message to the first client 202 for theaccounting user in Delhi, India, to suggest selecting any of theaccounting documents that were already cached by the cache server 206 inBangalore, India. In another example, the cache server analyzer 212sends a message to the first client 202 for the accounting user inDelhi, India, to suggest selecting any of the accounting documents thatare available by the content server 208 in Boston, Mass., some of whichare yet to be cached by the cache server 206 in Bangalore, India. Thecache server analyzer 212 only suggests content to a user based on theuser's access levels. For example, accounting documents that areavailable only to accounting supervisors are not available to theaccounting user in Delhi, India if this accounting user does not haveaccess rights at the level of an accounting supervisor or higher. Inanother example, the relationship database 224 includes a graph whichindicates that the users Alice, Bob and Charlie have worked on the sameaccounting document, so they have a relationship with each otherreferred to as a “team.” When Alice uses her client to access theenterprise content management system, and looks for the accountingdocuments she previously worked on, the cache server analyzer 212 willalso suggest that she consider accessing the documents worked upon byBob and Charles, as well as other documents which have the tag“accounting.”

The cache server analyzer may send a message to the client to suggestcontent associated with a second content category based on a level ofsimilarity between a user profile, associated with the user, whichidentifies the content category and a second user profile, associatedwith a second user, which identifies the second content category basedon content associated with a second client associated with the seconduser. For example, the cache server analyzer 212 sends a message to thefirst client 202 for the accounting user in Delhi, India, to suggestselecting any of the customer service documents that were already cachedby the cache server 206 in Bangalore, India because the user profile ofthe accounting user is similar in the content accessed to a user profileof another user who accesses customer service documents as well asaccounting documents. The cache server analyzer 212 can use thestructures of the graph database to analyze relations between users andtags so that two similar users, based on their access levels, can besuggested to select content accessed by the other user. The cache server206 may allocate memory for its cached content 214 based on analysis ofcached content categories. For example, 25% of the documents cached bythe cache server 206 in Bangalore, India, are accounting documents and20% of the documents cached are customer service documents. Therefore,the cache server 206 allocates 25% of its memory to pre-cache accountingdocuments and 20% of its memory to pre-cache customer service documents,but the allocation percentages do not have to match the cachingpercentages.

Embodiments herein enable cache data analysis for enterprise contentmanagement systems. The cache server 206 pre-caches content likely to berequested by its geographically based users before the content isactually requested by its geographically based users, therebysignificantly reducing the lag time experienced by these users when theyrequest the content that they are likely to request.

FIG. 3 is a flowchart that illustrates a method of cache data analysisfor enterprise content management systems. Flowchart 300 illustratesmethod acts illustrated as flowchart blocks for certain steps involvedin and/or between the clients 202-204 and/or the servers 206-208 of FIG.2.

A content category is identified based on content provided by a cacheserver associated with an enterprise content management system, act 302.For example, a cache server analyzer 212 analyzes the content that acache server 206 in Bangalore, India, provides in responding to userrequests by caching documents which are related to accounting more oftenthan a provision threshold, such as 25% of the time.

A determination is made whether content associated with a contentcategory is cached by a cache server, act 304. For example, the cacheserver analyzer 212 determines whether all of the accounting documentsmanaged by the content server 208 in Boston, Mass., are cached by thecache server 206 in Bangalore, India. If all of the content associatedwith a content category is cached by a cache server, the method 300restarts at act 302. If some of the content associated with a contentcategory is not cached by a cache server, the method 300 continues toact 306.

Content associated with a content category is requested by a cacheserver if the content associated with the content category is not cachedby the cache server, act 306. For example, the cache server analyzer 212requests any of the accounting documents managed by the content server208 in Boston, Mass., which are not cached by the cache server 206 inBangalore, India.

Content associated with a content category is cached by a cache server,act 308. For example, the cache server 206 in Bangalore, India cachesany of the accounting documents managed by the content server 208 inBoston, Mass., which were not already cached by the cache server 206 inBangalore, India.

A content category is optionally identified based on content associatedwith a client associated with a user, act 310. For example, the cacheserver analyzer 212 analyzes that a user in Delhi, India, requestsdocuments that are related to accounting more often than an associationthreshold, such as 50% of the time.

A message is optionally sent to a client to suggest content associatedwith content category, act 312. For example, the cache server analyzer212 sends a message to the first client 202 for the accounting user inDelhi, India, to suggest selecting any of the accounting documents thatwere already cached by the cache server 206 in Bangalore, India.

A message is optionally sent to a client to suggest content associatedwith a second content category based on a level of similarity between auser profile, associated with a user, which identifies a contentcategory, and a second user profile, associated with a second user,which identifies the second content category based on content associatedwith a second client associated with the second user, act 314. Forexample, the cache server analyzer 212 sends a message to the firstclient 202 for the accounting user in Delhi, India, to suggest selectingany of the customer service documents that were already cached by thecache server 206 located in Bangalore, India, because the user profileof the accounting user is similar in the content accessed to a userprofile of another user who accesses customer service documents as wellas accounting documents.

Although FIG. 3 depicts the acts 302-314 occurring in a specific order,the acts 302-314 may occur in another order. Embodiments herein enablecache data analysis for enterprise content management systems. The cacheserver 206 pre-caches content likely to be requested by itsgeographically based users before the content is actually requested byits geographically based users, thereby significantly reducing the lagtime experienced by these users when they request the content that theyare likely to request.

The use of the terms “a” and “an” and “the” and similar referents in thecontext of describing the subject matter (particularly in the context ofthe following claims) are to be construed to cover both the singular andthe plural, unless otherwise indicated herein or clearly contradicted bycontext. Recitation of ranges of values herein are merely intended toserve as a shorthand method of referring individually to each separatevalue falling within the range, unless otherwise indicated herein, andeach separate value is incorporated into the specification as if it wereindividually recited herein. Furthermore, the foregoing description isfor the purpose of illustration only, and not for the purpose oflimitation, as the scope of protection sought is defined by the claimsas set forth hereinafter together with any equivalents thereof entitledto. The use of any and all examples, or exemplary language (e.g., “suchas”) provided herein, is intended merely to better illustrate thesubject matter, and does not pose a limitation on the scope of thesubject matter unless otherwise claimed. The use of the term “based on”and other like phrases indicating a condition for bringing about aresult, both in the claims and in the written description, is notintended to foreclose any other conditions that bring about that result.No language in the specification should be construed as indicating anynon-claimed element as essential to the practice of the invention asclaimed.

Preferred embodiments are described herein, including the best modeknown to the inventor for carrying out the claimed subject matter. Ofcourse, variations of those preferred embodiments will become apparentto those of ordinary skill in the art upon reading the foregoingdescription. The inventor expects skilled artisans to employ suchvariations as appropriate, and the inventor intends for the claimedsubject matter to be practiced otherwise than as specifically describedherein. Accordingly, this claimed subject matter includes allmodifications and equivalents of the subject matter recited in theclaims appended hereto as permitted by applicable law. Moreover, anycombination of the above-described elements in all possible variationsthereof is encompassed unless otherwise indicated herein or otherwiseclearly contradicted by context.

What is claimed is:
 1. A system for provisioning content, the systemcomprising: a processor; a non-transitory, computer-readable mediumstoring a processor-based application, the processor-based applicationincluding instructions that are executable by the processor to: provide,by a first server associated with an enterprise content managementsystem, documents in response to user requests; identify contentcategories associated with the documents provided by the first server,wherein identifying the content categories comprises parsing thedocuments provided by the first server and extracting tags correspondingto the content categories from the documents provided by the firstserver; store, in a relationship database, the extracted tags, userinformation about users associated with clients that requested thedocuments provided by the first server, and relationships between thetags and the user information; identify, using the tags, the userinformation and the relationships stored in the relationship database, afirst content category based on content associated with a clientassociated with a first user, the content associated with the clientassociated with the first user including first content accessed by thefirst user; identify a first document associated with the first contentcategory, and not part of the first content accessed by the first user;and send a message from the first server to the client associated withthe first user to suggest the first document associated with the firstcontent category to the first user.
 2. The system for provisioningcontent of claim 1, wherein identifying the first document comprisesidentifying the first document based on a user-access level of the firstuser.
 3. The system for provisioning content of claim 1, whereinidentifying the first content category comprises determining that thefirst user requested documents associated with the first contentcategory more than an association threshold.
 4. The system forprovisioning content of claim 1, wherein the processor-based applicationcomprises instructions executable to: maintain a relationship betweenthe first user, the first content accessed by the first user, a seconduser who accessed the first content, and second content accessed by thesecond user; and identify the first document based on the relationship.5. The system for provisioning content of claim 4, wherein identifyingthe first document is based on determining from the relationship thatthe first user and the second user accessed a second document associatedwith the first content category and that the second user accessed thefirst document.
 6. The system for provisioning content of claim 4,wherein identifying the first content category comprises determiningthat the first user and second user accessed content associated with asecond content category and the second user accessed content associatedwith the first content category.
 7. The system for provisioning contentof claim 1, wherein the processor-based application comprisesinstructions executable to: pre-allocate memory at the first server tostore content related to the first content category based on an analysisof the identified content categories; determine that the first serverresponded to the user requests with documents that are related to thefirst content category more often than a provision threshold; based on adetermination that there are additional documents associated with thefirst content category stored by a content server and not stored by thefirst server, request, by the first server, the additional documentsassociated with the first content category from the content server.
 8. Acomputer program product comprising a non-transitory, computer-readablemedium storing a set of the computer-executable instructions, the set ofcomputer-executable instructions comprising instructions for: providing,by a first server associated with an enterprise content managementsystem, documents in response to user requests; identifying contentcategories associated with the documents provided by the first server,wherein identifying the content categories comprises parsing thedocuments provided by the first server and extracting tags correspondingto the content categories from the documents provided by the firstserver; storing, in a relationship database, the extracted tags, userinformation about users associated with clients that requested thedocuments provided by the first server, and relationships between thetags and the user information; identifying, using the tags, the userinformation and the relationships stored in the relationship database, afirst content category based on content associated with a clientassociated with a first user, the content associated with the clientassociated with the first user including first content accessed by thefirst user; identifying a first document associated with the firstcontent category, and not part of the first content accessed by thefirst user; and sending a message from the first server to the clientassociated with the first user to suggest the first document associatedwith the first content category to the first user.
 9. The computerprogram product of claim 8, wherein identifying the first documentcomprises identifying the first document based on a user-access level ofthe first user.
 10. The computer program product of claim 8, whereinidentifying the first content category comprises determining that thefirst user requested documents associated with the first contentcategory more than an association threshold.
 11. The computer programproduct of claim 8, wherein the set of computer executable instructionsfurther comprises instructions for: maintaining a relationship betweenthe first user, the first content accessed by the first user, a seconduser who accessed the first content, and second content accessed by thesecond user; and identifying the first document based on therelationship.
 12. The computer program product of claim 11, whereinidentifying the first document is based on determining from therelationship that the first user and the second user accessed a seconddocument associated with the first content category and that the seconduser accessed the first document.
 13. The computer program product ofclaim 11, wherein identifying the first content category comprisesdetermining that the first user and second user accessed contentassociated with a second content category and the second user accessedcontent associated with the first content category.
 14. The computerprogram product of claim 8, wherein the set of computer-executableinstructions further comprises instructions for: pre-allocating memoryat the first server to store content related to the first contentcategory based on an analysis of the identified content categories;determining that the first server responded to the user requests withdocuments that are related to the first content category more often thana provision threshold; based on a determination that there areadditional documents associated with the first content category storedby a content server and not stored by the first server, requesting, bythe first server, the additional documents associated with the firstcontent category from the content server.
 15. A method for provisioningcontent, the method comprising: providing, by a first server associatedwith an enterprise content management system, documents in response touser requests; identifying content categories associated with thedocuments provided by the first server, wherein identifying the contentcategories comprises parsing the documents provided by the first serverand extracting tags corresponding to the content categories from thedocuments provided by the first server; storing, in a relationshipdatabase, the extracted tags, user and information about usersassociated with clients that requested the documents provided by thefirst server, and relationships between the tags and the userinformation; identifying, using the tags, the user information and therelationships stored in the relationship database, a first contentcategory based on content associated with a client associated with afirst user, the content associated with the client associated with thefirst user including first content accessed by the first user;identifying a first document associated with the first content category,and not part of the first content accessed by the first user; andsending a message from the first server to the client associated withthe first user to suggest the first document associated with the firstcontent category to the first user.
 16. The method of claim 15, whereinidentifying the first document comprises identifying the first documentbased on a user-access level of the first user.
 17. The method of claim15, wherein identifying the first content category comprises determiningthat the first user requested documents associated with the firstcontent category more than an association threshold.
 18. The method ofclaim 15, wherein the set of computer executable instructions furthercomprises instructions for: maintaining a relationship between the firstuser, the first content accessed by the first user, a second user whoaccessed the first content, and second content accessed by the seconduser; and identifying the first document based on the relationship. 19.The method of claim 18, wherein identifying the first document is basedon determining from the relationship that the first user and the seconduser accessed a second document associated with the first contentcategory and that the second user accessed the first document.
 20. Themethod of claim 18, wherein identifying the first content categorycomprises determining that the first user and second user accessedcontent associated with a second content category and the second useraccessed content associated with the first content category.
 21. Themethod of claim 15, wherein the set of computer-executable instructionsfurther comprises instructions for: pre-allocating memory at the firstserver to store content related to the first content category based onan analysis of the identified content categories; determining that thefirst server responded to the user requests with documents that arerelated to the first content category more often than a provisionthreshold; based on a determination that there are additional documentsassociated with the first content category stored by a content serverand not stored by the first server, requesting, by the first server, theadditional documents associated with the first content category from thecontent server.